class: center, middle, inverse, title-slide .title[ # The What and the Why of Statistics ] .subtitle[ ## EDP 613 ] .author[ ### Week 1 ] --- <script> function resizeIframe(obj) { obj.style.height = obj.contentWindow.document.body.scrollHeight + 'px'; } </script>
# Basic Ideas >- **Parameter** - A number describing an entire *population* -- >- **Statistic** - A number describing a slice, or a *sample* of a *population* --- # Polar Views of the World **Frequentist Statisticians** believe that there is one and only one correct parameter that can be found by using multiple samples. - parameters are fixed and data vary - there is a single truth that can be found with enough indicators - only objectibity can be used -- **Bayesian Statisticians** believe that multiple parameters exist which are all based on varying probabilities. - parameters vary and the data is fixed - there are multiple truths and getting to any one is based on chance - subjectivity is a built feature --- # Learning Statistics - You likely do not know enough about probability so for now assume that the frequentist point-of-view is correct. -- - It is easier to begin to learn statistics if you don't have to consider multiple outcomes in superposition. -- - We will come back to the Bayesian vs. Frequentist arguement --- # Start **Descriptive Statistics** - Mathematical techniques for organizing and summarizing a set of numerical data -- # Finish **Inferential Statistics** - Generalizing from a sample to a population --- # Definitions - Information is collected on *elements* or *individuals* -- - The characteristics of the individuals about which we collect information are called *variables* -- - The values of the variables that we obtain are called *data* --- # Overarching Types of Data - *Qualitative variables* (aka *categorical variables*) classify elements into categories. -- - *Quantitative variables* tell how much or how many of something there is. --- # Example Which of the following variables are qualitative and which are quantitative? <center> <table class="table" style="width: auto !important; margin-left: auto; margin-right: auto;"> <thead> <tr> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Situation </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Type </th> </tr> </thead> <tbody> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 1 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> The name of the schools in your district. </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;background-color: #212121 !important;"> 2 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;background-color: #212121 !important;"> The number of schools in your district. </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;background-color: #212121 !important;"> </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 3 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> The amount of each ingredient in a cake. </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> </td> </tr> <tr> <td style="text-align:center;width: 5em; background-color: #212121 !important;"> 4 </td> <td style="text-align:left;width: 40em; background-color: #212121 !important;"> The ingredients in a cake. </td> <td style="text-align:left;width: 20em; background-color: #212121 !important;"> </td> </tr> </tbody> </table> </center> --- # Solution <center> <table class="table" style="width: auto !important; margin-left: auto; margin-right: auto;"> <thead> <tr> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Situation </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Type </th> </tr> </thead> <tbody> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 1 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> The name of the schools in your district. </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> Qualitative </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;background-color: #212121 !important;"> 2 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;background-color: #212121 !important;"> The number of schools in your district. </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;background-color: #212121 !important;"> Quantitative </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 3 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> The amount of each ingredient in a cake. </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> Quantitative </td> </tr> <tr> <td style="text-align:center;width: 5em; background-color: #212121 !important;"> 4 </td> <td style="text-align:left;width: 40em; background-color: #212121 !important;"> The ingredients in a cake. </td> <td style="text-align:left;width: 20em; background-color: #212121 !important;"> Qualitative </td> </tr> </tbody> </table> </center> --- # Levels of Measurement <center> <table class="table" style="width: auto !important; margin-left: auto; margin-right: auto;"> <thead> <tr> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> </th> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> Nominal </th> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> Ordinal </th> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> Interval </th> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> Ratio </th> </tr> </thead> <tbody> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> Naming, labeling, or classifying observations </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;"> <svg aria-hidden="true" role="img" viewbox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:#ffffff;overflow:visible;position:relative;"><path d="M438.6 105.4C451.1 117.9 451.1 138.1 438.6 150.6L182.6 406.6C170.1 419.1 149.9 419.1 137.4 406.6L9.372 278.6C-3.124 266.1-3.124 245.9 9.372 233.4C21.87 220.9 42.13 220.9 54.63 233.4L159.1 338.7L393.4 105.4C405.9 92.88 426.1 92.88 438.6 105.4H438.6z"></path></svg> </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;"> <svg aria-hidden="true" role="img" viewbox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:#ffffff;overflow:visible;position:relative;"><path d="M438.6 105.4C451.1 117.9 451.1 138.1 438.6 150.6L182.6 406.6C170.1 419.1 149.9 419.1 137.4 406.6L9.372 278.6C-3.124 266.1-3.124 245.9 9.372 233.4C21.87 220.9 42.13 220.9 54.63 233.4L159.1 338.7L393.4 105.4C405.9 92.88 426.1 92.88 438.6 105.4H438.6z"></path></svg> </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;"> <svg aria-hidden="true" role="img" viewbox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:#ffffff;overflow:visible;position:relative;"><path d="M438.6 105.4C451.1 117.9 451.1 138.1 438.6 150.6L182.6 406.6C170.1 419.1 149.9 419.1 137.4 406.6L9.372 278.6C-3.124 266.1-3.124 245.9 9.372 233.4C21.87 220.9 42.13 220.9 54.63 233.4L159.1 338.7L393.4 105.4C405.9 92.88 426.1 92.88 438.6 105.4H438.6z"></path></svg> </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;"> <svg aria-hidden="true" role="img" viewbox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:#ffffff;overflow:visible;position:relative;"><path d="M438.6 105.4C451.1 117.9 451.1 138.1 438.6 150.6L182.6 406.6C170.1 419.1 149.9 419.1 137.4 406.6L9.372 278.6C-3.124 266.1-3.124 245.9 9.372 233.4C21.87 220.9 42.13 220.9 54.63 233.4L159.1 338.7L393.4 105.4C405.9 92.88 426.1 92.88 438.6 105.4H438.6z"></path></svg> </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;background-color: #212121 !important;"> Ranks categories in order </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;background-color: #212121 !important;"> </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;background-color: #212121 !important;"> <svg aria-hidden="true" role="img" viewbox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:#ffffff;overflow:visible;position:relative;"><path d="M438.6 105.4C451.1 117.9 451.1 138.1 438.6 150.6L182.6 406.6C170.1 419.1 149.9 419.1 137.4 406.6L9.372 278.6C-3.124 266.1-3.124 245.9 9.372 233.4C21.87 220.9 42.13 220.9 54.63 233.4L159.1 338.7L393.4 105.4C405.9 92.88 426.1 92.88 438.6 105.4H438.6z"></path></svg> </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;background-color: #212121 !important;"> <svg aria-hidden="true" role="img" viewbox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:#ffffff;overflow:visible;position:relative;"><path d="M438.6 105.4C451.1 117.9 451.1 138.1 438.6 150.6L182.6 406.6C170.1 419.1 149.9 419.1 137.4 406.6L9.372 278.6C-3.124 266.1-3.124 245.9 9.372 233.4C21.87 220.9 42.13 220.9 54.63 233.4L159.1 338.7L393.4 105.4C405.9 92.88 426.1 92.88 438.6 105.4H438.6z"></path></svg> </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;background-color: #212121 !important;"> <svg aria-hidden="true" role="img" viewbox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:#ffffff;overflow:visible;position:relative;"><path d="M438.6 105.4C451.1 117.9 451.1 138.1 438.6 150.6L182.6 406.6C170.1 419.1 149.9 419.1 137.4 406.6L9.372 278.6C-3.124 266.1-3.124 245.9 9.372 233.4C21.87 220.9 42.13 220.9 54.63 233.4L159.1 338.7L393.4 105.4C405.9 92.88 426.1 92.88 438.6 105.4H438.6z"></path></svg> </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> Known equal intervals </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;"> </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;"> </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;"> <svg aria-hidden="true" role="img" viewbox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:#ffffff;overflow:visible;position:relative;"><path d="M438.6 105.4C451.1 117.9 451.1 138.1 438.6 150.6L182.6 406.6C170.1 419.1 149.9 419.1 137.4 406.6L9.372 278.6C-3.124 266.1-3.124 245.9 9.372 233.4C21.87 220.9 42.13 220.9 54.63 233.4L159.1 338.7L393.4 105.4C405.9 92.88 426.1 92.88 438.6 105.4H438.6z"></path></svg> </td> <td style="text-align:center;width: 20em; vertical-align: middle !important;"> <svg aria-hidden="true" role="img" viewbox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:#ffffff;overflow:visible;position:relative;"><path d="M438.6 105.4C451.1 117.9 451.1 138.1 438.6 150.6L182.6 406.6C170.1 419.1 149.9 419.1 137.4 406.6L9.372 278.6C-3.124 266.1-3.124 245.9 9.372 233.4C21.87 220.9 42.13 220.9 54.63 233.4L159.1 338.7L393.4 105.4C405.9 92.88 426.1 92.88 438.6 105.4H438.6z"></path></svg> </td> </tr> <tr> <td style="text-align:center;width: 5em; background-color: #212121 !important;"> Includes a natural zero point </td> <td style="text-align:center;width: 20em; background-color: #212121 !important;"> </td> <td style="text-align:center;width: 20em; background-color: #212121 !important;"> </td> <td style="text-align:center;width: 20em; background-color: #212121 !important;"> </td> <td style="text-align:center;width: 20em; background-color: #212121 !important;"> <svg aria-hidden="true" role="img" viewbox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:#ffffff;overflow:visible;position:relative;"><path d="M438.6 105.4C451.1 117.9 451.1 138.1 438.6 150.6L182.6 406.6C170.1 419.1 149.9 419.1 137.4 406.6L9.372 278.6C-3.124 266.1-3.124 245.9 9.372 233.4C21.87 220.9 42.13 220.9 54.63 233.4L159.1 338.7L393.4 105.4C405.9 92.88 426.1 92.88 438.6 105.4H438.6z"></path></svg> </td> </tr> </tbody> </table> </center> --- # Note Your textbook pools interval and ratio together as *interval-ratio*. --- # Example <center> <table class="table" style="width: auto !important; margin-left: auto; margin-right: auto;"> <thead> <tr> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Situation </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Type </th> </tr> </thead> <tbody> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 1 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> The (typical) letter grade distribution in a school </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;background-color: #212121 !important;"> 2 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;background-color: #212121 !important;"> Toppings on a cheeseburger </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;background-color: #212121 !important;"> </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 3 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> Social economic status </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> </td> </tr> <tr> <td style="text-align:center;width: 5em; background-color: #212121 !important;"> 4 </td> <td style="text-align:left;width: 40em; background-color: #212121 !important;"> A telephone number </td> <td style="text-align:left;width: 20em; background-color: #212121 !important;"> </td> </tr> <tr> <td style="text-align:center;width: 5em; "> 5 </td> <td style="text-align:left;width: 40em; "> Time </td> <td style="text-align:left;width: 20em; "> </td> </tr> </tbody> </table> </center> --- # Solution <center> <table class="table" style="width: auto !important; margin-left: auto; margin-right: auto;"> <thead> <tr> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Situation </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Type </th> </tr> </thead> <tbody> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 1 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> The (typical) letter grade distribution in a school </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> Ordinal </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;background-color: #212121 !important;"> 2 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;background-color: #212121 !important;"> Toppings on a cheeseburger </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;background-color: #212121 !important;"> Nominal </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 3 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> Social economic status </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> Ordinal </td> </tr> <tr> <td style="text-align:center;width: 5em; background-color: #212121 !important;"> 4 </td> <td style="text-align:left;width: 40em; background-color: #212121 !important;"> A telephone number </td> <td style="text-align:left;width: 20em; background-color: #212121 !important;"> Ordinal </td> </tr> <tr> <td style="text-align:center;width: 5em; "> 5 </td> <td style="text-align:left;width: 40em; "> Time </td> <td style="text-align:left;width: 20em; "> Interval Ratio </td> </tr> </tbody> </table> </center> --- # Discrete and Continuous - *Discrete variables* are quantitative variables whose possible values can be listed - possibly infinite - obtained by counting - *Continuous variables* are quantitative variables that can take on any value in some interval. - possibly infinite - obtained by measuring --- # Example Which of the following variables are discrete or continuous? <center> <table class="table" style="width: auto !important; margin-left: auto; margin-right: auto;"> <thead> <tr> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Situation </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Type </th> </tr> </thead> <tbody> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 1 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> Time it takes to get to school </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;background-color: #212121 !important;"> 2 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;background-color: #212121 !important;"> Water temperature </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;background-color: #212121 !important;"> </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 3 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> Ratings on a 5-point rating scale </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> </td> </tr> <tr> <td style="text-align:center;width: 5em; background-color: #212121 !important;"> 4 </td> <td style="text-align:left;width: 40em; background-color: #212121 !important;"> Number of cars currently in a parking lot </td> <td style="text-align:left;width: 20em; background-color: #212121 !important;"> </td> </tr> </tbody> </table> </center> --- # Solution <center> <table class="table" style="width: auto !important; margin-left: auto; margin-right: auto;"> <thead> <tr> <th style="text-align:center;vertical-align: middle !important;background-color: #212121 !important;"> </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Situation </th> <th style="text-align:left;vertical-align: middle !important;background-color: #212121 !important;"> Type </th> </tr> </thead> <tbody> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 1 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> Time it takes to get to school </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> Continuous </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;background-color: #212121 !important;"> 2 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;background-color: #212121 !important;"> Water temperature </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;background-color: #212121 !important;"> Continuous </td> </tr> <tr> <td style="text-align:center;width: 5em; vertical-align: middle !important;"> 3 </td> <td style="text-align:left;width: 40em; vertical-align: middle !important;"> Ratings on a 5-point rating scale </td> <td style="text-align:left;width: 20em; vertical-align: middle !important;"> Discrete </td> </tr> <tr> <td style="text-align:center;width: 5em; background-color: #212121 !important;"> 4 </td> <td style="text-align:left;width: 40em; background-color: #212121 !important;"> Number of cars currently in a parking lot </td> <td style="text-align:left;width: 20em; background-color: #212121 !important;"> Discrete </td> </tr> </tbody> </table> </center> --- ## That's it. Take a break before our R session!